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E^TE 



3 <110> APPLICANT: DING, SHI -YOU ' 

4 ADNEY f WILLIAM S * 

5 VINZANT, TODD B. 

6 DECKER, STEPHEN R. 
7- HIMMEL, MICHAEL E. 

9 <120> TITLE OF INVENTION: THERMAL TOLERANT CELLULASE FROM ACIDOTHERMUS 

10 CELLULOLYTICUS . 

12 <130> FILE REFERENCE : U0170 . 6U S0l) 
C--> 14 <140> CURRENT APPLICAT^OiMWMffER: US/ 09/917 , 384 
15 <141> CURRENT FILING DATE: Z0&1-O7-28 
17 <160> NUMBER OF SEQ ID NOS / 14 0 
19 <170> SOFTWARE: Patent In VeSrr-^2.1 

21 <210> SEQ ID NO: 1 

22 <211> LENGTH: 1228 

23 <212> TYPE: PRT 

24 <213> ORGANISM: Artificial Sequence 

26 <22 0> FEATURE: 

27 <223> OTHER INFORMATION: Description of Artificial Sequence: Segment of 

28 GuxA 

30 <400> SEQUENCE: 1 

31 Met Glu Arg Thr Gin Gin Ser Gly Arg Asn Cys Arg Tyr Gin Arg Gly 

32 1 5 10 15 

34 Thr Thr Arg Met Pro Ala lie Ser Lys Arg Leu Arg Ala Gly Val Leu 
.35 20 25 30 

37 Ala Gly Ala Val Ser He Ala Ala Ser He Val Pro Leu Ala Met Gin 

38 35 40 45 

40 His Pro Ala He Ala Ala Thr His Val Asp Asn Pro Tyr Ala Gly Ala 

41 50 55 60 

4 3 Thr Phe Phe Val Asn Pro Tyr Trp Ala Gin Glu Val Gin Ser Glu Ala 
44 65 70 75 80 

4 6 Ala Asn Gin Thr Asn Ala Thr Leu Ala Ala Lys Met Arg Val Val Ser 
47 85 90 95 

49 Thr Tyr Ser Thr Ala Val Trp Met Asp Arg He Ala Ala He Asn Gly 

50 100 105 110 

52 Val Asn Gly Gly Pro Gly Leu Thr Thr Tyr Leu Asp Ala Ala Leu Ser 

53 115 120 125 

55 Gin Gin Gin Gly Thr Thr Pro Glu Val He Glu He Val He Tyr Asp 

56 130 135 140 

58 Leu Pro Gly Arg Asp Cys Ala Ala Leu Ala Ser Asn Gly Glu Leu Pro 

59 145 150 155 160 

61 Ala Thr Ala Ala Gly Leu Gin Thr Tyr Glu Thr Gin Tyr He Asp Pro 

62 165 170 175 

64 He Ala Ser He Leu Ser Asn Pro Lys Tyr Ser Ser Leu Arg He Val 
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65 180 185 190 

67 Thr lie lie Glu Pro Asp Ser Leu Pro Asn Ala Val Thr Asn Met Ser 

68 195 200 205 

70 lie Gin Ala Cys Ala Thr Ala Val Pro Tyr Tyr Glu Gin Gly lie Glu 

71 210 215 220 

73 Tyr Ala Leu Thr Lys Leu His Ala lie Pro Asn Val Tyr lie Tyr Met 

74 225 230 235 240 

76 Asp Ala Ala His Ser Gly Trp Leu Gly Trp Pro Asn Asn Ala Ser Gly 

77 245 250 255 

79 Tyr Val Gin Glu Val Gin Lys Val Leu Asn Ala Ser He Gly Val Asn 

80 260 265 270 

82 Gly He Asp Gly Phe Val Thr Asn Thr Ala Asn Tyr Thr Pro Leu Lys 

83 275 280 285 

85 Glu Pro Phe Met Thr Ala Thr Gin Gin Val Gly Gly Gin Pro Val Glu 

86 290 295 300 

88 Ser Ala Asn Phe Tyr Gin Trp Asn Pro Asp He Asp Glu Ala Asp Tyr 

89 305 310 315 320 

91 Ala Val Asp Leu Tyr Ser Arg Leu Val Ala Ala Gly Phe Pro Ser Ser 

92 325 330 335 

94 He Gly Met Leu He Asp Thr Leu Arg Asn Gly Trp Gly Gly Pro Asn 

95 340 345 350 

97 Glu Pro Thr Gly Pro Ser Thr Ala Thr Asp Val Asn Thr Phe Val Asn 

98 355 360 365 

100 Gin Ser Lys He Asp Leu Arg Gin His Arg Gly Leu Trp Cys Asn Gin 

101 370 375 380 

103 Asn Gly Ala Gly Leu Gly Gin Pro Pro Gin Ala Ser Pro Thr Asp Phe 

104 385 390 395 400 

106 Pro Asn Ala His Leu Asp Ala Tyr Val Trp He Lys Pro Pro Gly Glu 

107 405 410 415 

109 Ser Asp Gly Thr Ser Ala Ala Ser Asp Pro Thr Thr Gly Lys Lys Ser 

110 420 425 430 

112 Asp Pro Met Cys Asp Pro Thr Tyr Thr Thr Ser Tyr Gly Val Leu Thr 

113 435 440 445 

115 Asn Ala Leu Pro Asn Ser Pro He Ala Gly Gin Trp Phe Pro Ala Gin 

116 450 455 460 

118 Phe Asp Gin Leu Val Ala Asn Ala Arg Pro Ala Val Pro Thr Ser Thr 

119 465 470 ' 475 480 

121 Ser Ser Ser Pro Pro Pro Pro Pro Pro Ser Pro Ser Ala Ser Pro Ser 

122 485 490 495 

124 Pro Ser Pro Ser Pro Ser Pro Ser Ser Ser Pro Ser Pro Ser Pro Ser 

125 500 505 510 

127 Pro Ser Ser Ser Pro Ser Pro Ser Pro Ser Pro Ser Pro Ser Pro Ser 

128 515 520 525 

130 Ser Ser Pro Ser Pro Ser Pro Ser Ser Ser Pro Ser Pro Ser Pro Ser 

131 530 535 540 

133 Pro Ser Pro Ser Pro Ser Ser Ser Pro Ser Pro Ser Pro Ser Ser Ser 

134 545 550 555 560 

136 Pro Ser Pro Ser Pro Ser Pro Ser Pro Ser Pro Ser Ser Ser Pro Ser 

137 565 570 575 
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139 Pro Ser Pro Thr Ser Ser Pro Val Ser Gly Gly Leu Lys Val Gin Tyr 

140 580 585 590 

142 Lys Asn Asn Asp Ser Ala Pro Gly Asp Asn Gin lie Lys Pro Gly Leu 

143 595 600 605 

14 5 Gin Leu Val Asn Thr Gly Ser Ser Ser Val Asp Leu Ser Thr Val Thr 
146 610 615 620 

14 8 Val Arg Tyr Trp Phe Thr Arg Asp Gly Gly Ser Ser Thr Leu Val Tyr 
149 625 " 630 635 640 

151 Asn Cys Asp Trp Ala Ala Met Gly Cys Gly Asn lie Arg Ala Ser Phe 

152 645 650 655 

154 Gly Ser Val Asn Pro Ala Thr Pro Thr Ala Asp Thr Tyr Leu Gin Leu 

155 660 665 670 

157 Ser Phe Thr Gly Gly Thr Leu Ala Ala Gly Gly Ser Thr Gly Glu He 

158 675 680 685 

160 Gin Asn Arg Val Asn Lys Ser Asp Trp Ser Asn Phe Thr Glu Thr Asn 

161 690 695 700 

163 Asp Tyr Ser Tyr Gly Thr Asn Thr Thr Phe Gin Asp Trp Thr Lys Val 

164 705 710 715 720 
16 6 Thr Val Tyr Val Asn Gly Val Leu Val Trp Gly Thr Glu Pro Ser Gly 
167 725 730 735 

169 Thr Ser Pro Ser Pro Thr Pro Ser Pro Ser Pro Ser Pro Ser Pro Ser 

170 740 745 750 

172 Pro Gly Gly Asp Val Thr Pro Pro Ser Val Pro Thr Gly Leu Val Val 

173 755 760 765 

175 Thr Gly Val Ser Gly Ser Ser Val Ser Leu Ala Trp Asn Ala Ser Thr 

176 770 775 780 

178 Asp Asn Val Gly Val Ala His Tyr Asn Val Tyr Arg Asn Gly Val Leu 

179 785 790 795 800 

181 Val Gly Gin Pro Thr Val Thr Ser Phe Thr Asp Thr Gly Leu Ala Ala 

182 805 810 815 

184 Gly Thr Ala Tyr Thr Tyr Thr Val Ala Ala Val Asp Ala Ala Gly Asn 

185 820 825 830 

187 Thr Ser Ala Pro Ser Thr Pro Val Thr Ala Thr Thr Thr Ser Pro Ser 

188 835 840 845 

190 Pro Ser Pro Thr Pro Thr Gly Thr Thr Val Thr Asp Cys Thr Pro Gly 

191 850 855 860 

193 Pro Asn Gin Asn Gly Val Thr Ser Val Gin Gly Asp Glu Tyr Arg Val 

194 865 870 875 880 

196 Gin Thr Asn Glu Trp Asn Ser Ser Ala Gin Gin Cys Leu Thr He Asn 

197 885 890 895 

199 Thr Ala Thr Gly Ala Trp Thr Val Ser Thr Ala Asn Phe Ser Gly Gly 

200 900 905 910 

202 Thr Gly Gly Ala Pro Ala Thr Tyr Pro Ser He Tyr Lys Gly Cys His 

203 915 920 925 

205 Trp Gly Asn Cys Thr Thr Lys Asn Val Gly Met Pro He Gin He Ser 

206 930 935 940 

208 Gin He Gly Ser Ala Val Thr Ser Trp Ser Thr Thr Gin Val Ser Ser 

209 945 950 955 960 
211 Gly Ala Tyr Asp Val Ala Tyr Asp He Trp Thr Asn Ser Thr Pro Thr 
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212 965 970 975 

214 Thr Thr Gly Gin Pro Asn Gly Thr Glu lie Met lie Trp Leu Asn Ser 

215 980 985 990 

217 Arg Gly Gly Val Gin Pro Phe Gly Ser Gin Thr Ala Thr Gly Val Thr 

218 995 1000 1005 

220 Val Ala Gly His Thr Trp Asn Val Trp Gin Gly Gin Gin Thr Ser Trp 

221 1010 1015 1020 

223 Lys lie lie Ser Tyr Val Leu Thr Pro Gly Ala Thr Ser lie Ser Asn 

224 1025 1030 1035 1040 

226 Leu Asp Leu Lys Ala lie Phe Ala Asp Ala Ala Ala Arg Gly Ser Leu 

227 1045 1050 1055 

229 Asn Thr Ser Asp Tyr Leu Leu Asp Val Glu Ala Gly Phe Glu lie Trp 

230 1060 1065 1070 

232 Gin Gly Gly Gin Gly Leu Gly Ser Asn Ser Phe Ser Val Ser Val Thr 

233 1075 1080 1085 

2 35 Ser Gly Thr Ser Ser Pro Thr Pro Ser Pro Ser Pro Thr Pro Thr Pro 
236 1090 1095 1100 

2 38 Ser Pro Thr Pro Thr Pro Ser Pro Ser Pro Thr Pro Ser Pro Ser Pro 

239 1105 1110 1115 1120 

241 Thr Ser Ser Pro Ser Ser Ser Gly Val Ala Cys Arg Ala Thr Tyr Val 

242 1125 1130 1135 

244 Val Asn Ser Asp Trp Gly Ser Gly Phe Thr Ala Thr Val Thr Val Thr 

245 1140 1145 1150 

247 Asn Thr Gly Ser Arg Ala Thr Asn Gly Trp Thr Val Ala Trp Ser Phe 

248 1155 1160 1165 

2 50 GLy Gly Asn Gin Thr Val Thr Asn Tyr Trp Asn Thr Ala Leu Thr Gin 
251 1170 1175 1180 

253 Ser Gly Ala Ser Val Thr Ala Thr Asn Leu Ser Tyr Asn Asn Val lie 

254 1185 1190 1195 1200 
2 56 Gin Pro Gly Gin Ser Thr Thr Phe Gly Phe Asn Gly Ser Tyr . Ser Gly 
257 1205 1210 1215 

2 59 Thr Asn Ala Ala Pro Thr Leu Ser Cys Thr Ala Ser 
260 1220 1225 

263 <210> SEQ ID NO: 2 

264 <211> LENGTH: 3687 

265 <212> TYPE: DNA 

266 <213> ORGANISM: Artificial Sequence 

268 <220> FEATURE: 

269 <223> OTHER INFORMATION: Description of Artificial Sequence: Segment of 

270 GuxA 

272 <400> SEQUENCE: 2 

273 atggagcgaa cccaacaatc cggacggaac tgcaggtacc agagaggaac gacacgaatg 60 

274 cccgccatct caaaacggct gcgagccggc gtcctcgccg gggcggtgag catcgcagcc 120 

275 tccatcgtgc cgctggcgat gcagcatcct gccatcgccg cgacgcacgt cgacaatccc 180 

276 tatgcgggag cgaccttctt cgtcaacccg tactgggcgc aagaagtaca gagcgaagcg 24 0 

277 gcgaaccaga ccaatgccac tctcgcagcg aaaatgcgcg tcgtttccac atattcgacg 300 

278 gccgtctgga tggaccgcat cgctgcgatc aacggcgtca acggcggacc cggcttgacg 360 

279 acatatctgg acgccgccct ctcccagcag cagggaacca cccctgaagt cattgagatt 420 

280 gtcatctacg atctgccggg acgcgactgc gcggcgctcg cctccaacgg cgaactgccc 480 
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281 gctacggcag 

282 ctgagcaatc 

283 ccaaacgcgg 

284 caaggcatcg 

285 gacgccgccc 

286 gtccagaagg 

287 acggcgaatt 

288 cagccggtgg 

289 gcggttgact 

290 atcgacacct 

291 accgatgtca 

292 tggtgcaacc 

293 ccgaacgcgc 

294 agcgctgcga 

295 acgacgtcgt 

296 ttcccggcgc 

297 agctcgagcc 

298 ccgagcccga 

299 ccgagcccga 

300 ccgtcgccga 

301 ccgtctccgt 

302 tcgtcgccgg 

303 gataaccaga 

304 tcgacggtga 

305 aactgtgact 

306 ccggcgacgc 

307 gctggtgggt 

308 accgagacca 

309 acggtgtacg 

310 cccacaccat 

311 agtgtgccga 

312 aatgcgtcga 

313 gtgggccagc 

314 acctacacgg 

315 accgccacca 

316 tgcacgcccg 

317 cagaccaatg 

318 gcctggacgg 

319 ccgtcgatct 

320 attcagatca 

321 ggcgcgtatg 

322 ccaaacggta 
32 3 tcgcagacag 

324 cagacctcgt 

325 ctggatttga 

326 tacctgctcg 

327 aactcgttca 

328 acgccgacac 

329 accagctcgc 



caggtttgca 
cgaagtactc 
tcaccaatat 
agtacgcgct 
actccggctg 
tcctcaacgc 
acacgccgtt 
agtcggcgaa* 
tgtactcgcg 
tacgcaacgg 
acaccttcgt 
agaacggtgc 
acctcgacgc 
gcgatccgac 
acggggtact 
agtttgacca 
cgccgcctcc 
gcagctcgcc 
gtccgagccc 
gcccgagtcc 
cgccgagccc 
tgtcgggtgg 
tcaaaccggg 
cggtgcggta 
gggcggcgat 
cgacggcgga 
cgacgggtga 
atgactactc 
tcaacggcgt 
ccccgagccc 
ccggcttggt 
cggataacgt 
cgacggtgac 
tggccgcggt 
ccacgagtcc 
gtcctaacca 
agtggaattc 
tgagcactgc 
acaagggctg 
gtcagattgg 
acgtggccta 
ccgaaatcat 
cgacgggtgt 
ggaagattat 
aggcgatttt 
acgttgaggc 
gcgtctccgt 
cgagcccgac 
cgtcgtcgtc 



gacctatgaa 
cagcctgcgg 
gagcattcaa 
cacgaaattg 
gcttgggtgg 
gagcatcggg 
gaaggagccg 
tttctaccag 
gctcgtcgcc 
ttggggtggt 
caaccagtcg 
gggcctcggc 
gtatgtctgg 
aactggcaag 
gaccaacgcg 
gcttgtcgcg 
gccgccgagt 
atcgccgtcg 
gagtagctcg 
gagcccgagt 
gagtccgagc 
gctgaaggtg 
tctccagttg 
ctggttcacc 
ggggtgtggg 
cacctacctg 
gattcaaaac 
gtatgggacg 
gttggtgtgg 
gagcccgagc 
ggtgacgggg 
gggggtggcg 
ctcgttcacc 
ggacgctgcg 
cagccccagc 
gaatggtgtg 
gtcggcccag 
gaacttcagc 
ccactggggc 
ttcggctgtg 
cgacatttgg 
gatttggctg 
gacggtcgct 
ttcctacgtc 
cgcggacgcc 
cgggtttgag 
gacgagcggc 
gccgacaccg 
gggtgtggcg 



acgcagtaca 
atcgtgacga 
gcgtgtgcaa 
cacgccattc 
cccaataatg 
gtcaacggca 
ttcatgaccg 
tggaatcctg 
gctggctttc 
ccgaacgaac 
aagattgacc 
cagccgccgc 
atcaagccgc 
aagtcggacc 
ttaccgaact 
aacgcacggc 
ccgtcggctt 
ccgtctccga 
ccgtcgccgt 
agctcgccgt 
ccgagtagct 
cagtacaaga 
gtgaataccg 
cgggatggtg 
aatatccgcg 
cagttgtcgt 
cgggtgaata 
aacaccacct 
gggactgaac 
ccgagcccgg 
gtgagtgggt 
cattacaacg 
gacacgggtt. 
ggtaacacct 
cccacgccga 
accagcgtgc 
cagtgcctca 
ggtgggaccg 
aactgcacca 
acgtcgtgga 
acgaacagta 
aattcgcgtg 
ggtcacacgt 
ctgacccccg 
gcggcacgcg 
atctggcaag 
acgtccagcc 
tctccgagcc 
tgccgggcga 



tcgatccgat 
tcattgagcc 
cggcggtgcc 
cgaacgtgta 
ccagcggata 
tcgacggctt 
ccacccagca 
acatcgacga 
caagcagcat 
caacaggccc 
ttcggcagca 
aggcaagccc 
cgggtgagtc 
ccatgtgcga 
ccccgatcgc 
cagcggtgcc 
cgccgagtcc 
gctcgagccc 
ctccgagctc 
cgccgtctcc 
cgccgtcgcc 
acaatgattc 
ggtcgtcgtc 
ggtcgtcgac 
cctcgttcgg 
tcactggtgg 
agagtgactg 
tccaggactg 
cgtccggcac 
gtggggatgt 
cgtcggtgtc 
tgtaccgcaa 
tggccgcggg 
ccgccccatc 
cggggaccac 
agggcgacga 
ccatcaatac 
gcggtgcgcc 
cgaagaacgt 
gtacgacgca 
ccccaacgac 
gtggggtgca 
ggaatgtctg 
gtgcgacgtc 
ggtcgctcaa 
gtggtcaggg 
cgacaccgag 
cgaccccgtc 
cgtatgtggt 



tgcgagtatc 
ggactcgctg 
gtattacgag 
catctacatg 
cgtacaggaa 
cgtcaccaac 
ggtcggcgga 
agccgactac 
cggcatgctc 
gagcaccgcg 
ccgcggcctg 
gacggacttc 
ggacggcaca 
cccgacgtac 
cggccagtgg 
gacgtcgacc 
gagcccgagt 
gtctccgtcg 
gagcccgtct 
gagctcgagc 
gtctccgacg 
ggcgccgggt 
ggtggatttg 
actggtgtac 
ctcggtgaac 
aacgttggcc 
gtcgaatttc 
gacgaaggtg 
cagccccagc 
gacgccgccg 
gttggcgtgg 
cggggtgttg 
aaccgcgtac 
cacccccgtc 
ggtcaccgac 
ataccgggtg 
cgcgaccggt 
cgcgacgtat 
cgggatgccg 
ggtgtcgtcg 
aaccggtcag 
gccgttcggg 
gcagggtcag 
gatcagtaat 
cacctccgat 
cctgggcagc 
cccgagcccg 
gccgagtccg 
gaatagtgat 



540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 



nie://C:\Crf3\Outhold\VsrI917384.htm 



1/29/02 



Page 6 of 7 



VERIFICATION SUMMARY • DATE: 01/29/2002 

PATENT APPLICATION: US/09/917,384 TIME: 12:17:20 

Input Set : A:\4017us01.app 

Output Set: N:\CRF3\01292002\I917384.raw 



L:14 M:270 C: Current Application Number differs, Replaced Current Application Number 
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